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© New expression control sequence. 



© New expression control sequences useful in the expres- 
sion of pro- or eukaryotic proteins in prokaryotic organisms 
are provided comprising a coliphage T5 promoter combined 
with a DNA sequence which permits the control of promoter 
activity. Also described are expression vectors containing 
these expression control sequences and processes using 
same for the manufacture of pro- or eukaryotic polypeptides. 
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The present invention relates to new expr es si on control 
ONA sequences, to expression vectors containing these 
DNA sequences, to host cells transformed with these ex- 
pression vectors, to methods tor the manufacture of said 
expression control DMA sequences, expression vectors and 
transformants, and to methods for producing pro- and 
eukaryotjc proteins by using the new expression control 
ONA sequences, vectors and transformants. 

The level of production of a protein in a host cefl is 
governed by three major factors: the number of copies of its 
gene within the cefl, the efficiency with which those gene 
copies are transcribed and the efficiency with which the 
resultant messenger RNA ("mRNA') is translated. Effi- 
ciency of tr an scr ipt ion and translation is in turn dependent 
upon the nucleotide sequences which are normally situated 
ahead of the desired coding sequence or gene. These 
nucleotide sequences (expression control DNA sequences) 
define, inter aBa. the location at which RNA polymerase 
interacts (the promoter sequence) to initiate transcription 
and at which ribosomes bind and interact with the mRNA 
(the product of transcription) to initiate translation. 

Not aJ such expression control DNA sequences func- 
tion with equal efficiency. It is thus often of advantage to 
separate the specific coding sequence or gene for a desired 
protein from its adjacent nucleotide sequences and to fuse it 
to other expression control DNA sequences so as to favor 
higher levels of expression. This having been achieved, the 
newly-engineered DNA fragment may be inserted into a 
higher copy number plasmid or a bacteriophage derivative 
in order to increase the number of gene copies within the 
cefl thereby improving the yield of the desired protein. 

Because over-production of even normally non-toxic 
gene products may be harmful to host ceQs and lead to 
decreased stability of particular host-vector systems, an 
expression control DNA sequence, in addition to improving 
the efficiency of transcription and translation of cloned 
genes, should be made controllable so as to allow modula- 
tion of expression during bacterial growth. For example, 
controllable expression control DNA sequences are ones 
that may be switched off to enable the host cells to propa- 
gate without excessive build-up of gene products and later 
be switched on to promote the expression of large amounts 
of the desired protein products, which are under the control 
of those expression control DNA sequences. 

Several expression control DNA sequences, which sat- 
isfy some of the criteria set forth above, have been em- 
ployed to express DNA sequences and genes coding for 
proteins and polypeptides in bacterial hosts. These include, 
for example, the operator, promoter and rfbosorne binding 
and interaction sequences of the lactose operon of EcoB 
(Ag., K. Itakura at aL. "Expression in Escherichia cofi of a 
chemically synthesized gene for the hormone somatostatin". 
Science. 198. pp. 1056-1063 [1977J D.V. Goeddel et aL. 
"Expression in Escherichia cofi of chemically synthesized 
genes for human insulin". PMAS USA 76. pp. 106-110 
[1979JJ. the corresponding sequences of the tryptophan 
synthetase system of E cofi (J.S. Emtage et aL. "Influenza 
antigenic determinants are expressed from Haemagglutinin 
genes cloned in Escherichia cofi", Nature, 283, pp. 
171-174 [1980J: J A. Martial et aL, "Human growth hor- 
mone: Complementary DNA cloning and expression in bac- 



teria". Science, 205, pp. 602-607 [1979]) and the major 
operator and promoter regions of phage X (H. Bernard et 
aL, "Construction of plasmid cloning vehicles that promote 
gene expression from the bacteriophage lambda P L prr> 

5 mater". Gene, 5. pp. 59-76 [1979]; European patent ap- 
plication, publication no. 41767). 

The present invention provides a novel and improved 
expression control DNA sequence comprising a art phage 
T5 promoter combined with a DNA sequence which permits 

io the control of promoter activity (regulating sequence). 

More precisely the present invention allows to combine 
a DNA sequence which permits the control of pr omot e r 
activity with a cofiphage T5 promoter while maintaining at 
the same time the highly efficient promoter function of the 

f5 art phage T5 promoter. 

In the invention T5 promoters are defined as premolar 
function mediating DNA sequences occurring in genomes of 
the cofiphage T5 family and functional combinations derived 
from such sequences. 

20 Naturally occurring T5 promoters are known to be 

efficient transcriptional initiation signals (A von Gabarn and 
H. Bujard, "Interaction of Ecofi RNA polymerase with pro- 
moters of cofiphage T5V Mofec gen. Genet, 157, pp. 
301-31 1 [1977]) with the following main properties: 

25 

(f) they exhibit a high forward rate constant in the complex 
formation between the promotar sequence and Ecofi RNA 
polymerase (A von Gabarn and H. Bujard, "Interaction of 
Escherichia cofi RNA polymerase with promoters of several 
30 cofiphage and plasmid DNAs". PNAS. 76. pp. 189-193 
[1979]); 

(a) they initiate RNA synthesis in vivo and in vitro with 
unusually high rates: 

35 

(5) they contain conserved sequences in 5 regions of the 
RNA polymerase binding site (H. Bujard et aL, "Integration 
of efficient promoters of the Ecofi system into plasmid 
vectors", in Gene Arrrptfficatron and Analysis, VoL 3: Ex- 
40 pression of cloned genes in prokaryotic and eukaryotjc cefls, 
80s. T.S. Papas. M. Rosenberg, and J.G. Crurikjian; El- 
sevier New York-AmstBfdam-Oxford, pp. 65-87 [1983]) and 

Ov) cloned in expression vehicles these promoters are not 
45 regulatabte. 



T5 promoters useful in the present invention are those 
of the "preearty" "early" and "late" expression class of the 
50 phage, especially the sequences described in the dissera- 
tion of R. Gentz. Urtversrtat Heidelberg, 1984: P m , P„ 
(named P»es « the present specification), P„, G». P^ 
(named P„ in Bujard et aL supra). P mb , G,„ G„ G„, 
G >H G„. 

55 The DNA sequence of some of the preferred T5 pro- 

moters mentioned above are indicated in Table I below: 



Table 1 

60 



65 
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Table i shows the nucleotide sequence of these four 
T5 promoters reacting rapidly with E-coB RNA polymerase 
Regions of major homology are boxed. Most striking are the 
homologies in -10. -33, -43, and +7 regions, as well as the 
precise spacing (18 bp) between -10 and -33 homologies. 
The underlined sequences centering around -43 indicate 
regions where a strong selection against GC base pairs 
occurs. 

DNA sequences which permit the control of the pro- 
moter activity are known in the art They occur in operons 
(reguiatabie expression unit) of various types (JJ-i. Miller 
. and W.S. Reznikoff, "The operon", Cold Spring Harbor 
Laboratory [1980]). One such type is a negatively con- 
trolled expression unit whose essential element is an oper- 
ator sequence which interacts with a repressor protein. The 
coding sequence for the repressor protein can be located 
on a plasmid carrying the operator, or on a separate vector 
or in the host chromosome. 

Preferred DNA sequences which permit the control of 
the promoter activity, used in this invention, are 
operator/represser systems. Especially preferred is the sys- 
tem of the lac operon consisting of the nature lac operator 
or functional sequences derived therefrom and repressor 
encrxSng DNA sequences producing natural or modified 
repressor molecules (J.H. MiDer and W.S. Reznikoff. supra). 

The expression control DNA sequences of the present 
invention can be obtained as follows: 

A coliphage T5 promoter sequence can be combined 
in a manner weft known in the art with one or more 
sequence^) which permit the control of promoter activity 
(regulating sequence) to a functional unit whereby the posi- 
tion of the regulating sequences e.g., an operator can vary. 
In such combinations the regulating sequence can be posi- 
tioned within or outside the T5 promoter sequence. Thus, a 
regulating sequence may be integrated within the promoter, 
may be partially overlapping with or may proceed or follow 
the promoter at various distances without any overlap. Pref- 
erably the regulating sequence overlaps the promoter se- 
quence between position + 1 and +20, -13 and -30 and/or 
-34 and -50 (nomenclature as in table I). Further preferred 
positions are regions "downstream" of position +20 up to 
+200. Especially preferred sites are those where the regu- 
lating sequence e.g., the lac operator sequence, overlaps 
the T5 promoter sequence until position + 2 in analogy to 
the lac operon, in Exoli, as shown in Rg. 7. 

The expression control DNA sequence of the present 
invention can be obtained in accordance with methods 
well-known in DNA chemistry including total chemical syn- 
thesis of the respective DNA sequence. 

In me preferred embodiment of this invention the lac 
operator can be negatively regulated by the lac repressor 
protein which is coded by the lac l-gene. To obtain proper 
amounts of repressor molecules inside the cell the cor- 



responding gene can be overexpressed by conventional 
methods, for example by integrating the lac V* gene (J.H. 
Milter and W.S. Reznikoff, supra; M.P. Calos, "DNA se- 
quence of a tow-level promoter of the lac repressor gene 

20 and an 'up' promoter mutation, Nature, 274, pp. 762-765 
[1978]) into a vector comprising the above expression con- 
trol sequence. 

The expression sequence of this invention can be 
introduced in any convenient expression vector of plasmid 

25 or phage origin in a manner known per se. Convenient 
expression vectors of plasmid or phage origin are mentioned 
e.g.. in the laboratory manual "Molecular Cloning" by Man- 
iatis et aL, Cold Spring Harbor Laboratory, 1982. 

Members of the pDSl -family of plasm ids were shown 

30 to be suitable for the stable integration of transcriptional 
signals of exceptional strength (D. Stuber and H. Bujard, 
"Transcription from efficient promoters can interfere with 
plasmid replication and diminish expression of plasmid 
specified genes". The EM BO Journal. 1. pp. 1399-1404 

35 [1982]). 

Therefore the preferred vectors used in this invention 
are those of the pDSi family. These piasmids are derived 
from plasmid pBR322 and comprise ag., pDSl.tol* and 
pDSl. to 2* resulting in plasmid constructions Eke 

40 pDS2/P N25 x/ 0 ,t o r and pDS3/PM25X/o.to2*. 

E.coli strains containing piasmids useful for such con- 
structions (Exoli M15 transformed with pDSl.t 0 1*; 
pDSi.toT; pDSIX.1) were deposited at Deutsche Samm- 
tung von Mikroorganismen (DSM) in Gottingen on Decern- 

« ber 11. 1984, the accession nos. being DSM 3135. DSM 
3136. DSM 3137 respectively. 

The DNA sequences that may be expressed by the 
expression vectors of this invention may be selected from a 
large variety of DNA sequences that encode prokaryotic or 

so eukaryotic polypeptides in vivo or in vitro. For example, 
such sequences may encode enzymes, hormones, polypep- 
tides with immuno-modulatory, anti-viral or anti-cancer prop- 
erties, antibodies, antigens, vaccines and other useful poly- 
peptides of prokaryotic or eukaryotic origin. 

55 Examples of proteins which can be expressed by using 

the improved expression control system of the present in- 
vention are oThydrofolate reductase, chloramphenicol acetyl- 
transferase, malaria surface antigens, rymphokms fike IL-2. 
interferons a, (3 and y, insulin and insulin precursors, growth 

60 hormones, tissue plasminogen activator or human renin. 

Methods for expressing DNA sequences coding for 
prokaryotic or eukaryotic proteins using the expression vec- 
tors of this invention are well known (Maniatis et al., supra). 
They include transforming an appropriate host with the 

65 expression vector having the desired DNA sequence oper- 
atively linked to the expression control sequence of the 
vector, culturing the host under appropriate conditions of 
growth and isolating the desired polypeptide from the cul- 
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lure. Those of ska in the art may select from these known 
methods those that are most effective for a particular gene 
expression without departing from the scope of this inven- 
tion. 

The selection of a particular host for use in this inven- 
tion is dependent upon a number of factors recognized by 
the art These include, for example, compatibility with the 
chosen expression vector, toxicity of the proteins encoded 
for by the hybrid ptasrrtid. ease of recovery of the desired 
protein, expression characteristics, riosafety and costs. 
Within these general guidelines, examples of useful bacterial 
hosts are gram-negative and gram-positive bacteria, espe- 
cially strains of Exoli and B-subtffis. The most preferred 
host cell of this invention is Exofi M15 (described as 
DZ291 in M.R. Vfflarejo et aL, "fl-Gatactosidase from ter- 
mination and deletion mutant strains", J. Bacterid., 120, pp. 
466-474 [1974]). However other Exoli strains such as 
Exofi 294 (ATCC No. 31448) and Exoli RRl (ATCC No. 
31343) can also be used. 

The present invention will be better understood on the 
basis of the following examples when considered in connec- 
tion with the following figures: 

. The abbreviations and symbols used are B. E, H, P, X 
and Xb which indicate sites for restriction endonucteases 
Baml EcoRI. Hindlll. Pstl, Xhol an d Xbal. respectively. 
^•"'"•"'"••^ repre senls promoters; t^^^ re p res e nt s rfbo somal 
binding sitesjmmjuii represents terminator to ;ES S3 repre- 
sents lac operator (O) or parts of it CZZD represents 
promoter Pn25 (Pjcs); *~ > represents region required for 
DMA -replication (repL);MM» represents coding region ei- 
ther for ajhydrofatat reductase (dhfr), chloramphenicol ac- 
etyttransferase (cat), ^-lactamase (Wa) and tac repressor 
(tecl) or for parts of th e codin g region for cNorarrqjhenicol 
acetyl transferase (caf);VBM represents parts of the cod- 
ing region for fl-oalactosidase (lac Z'). 

Figure i. Part a) is a schematic drawing of the piasmtd 
pDSl.tol*. The nucleotide sequence of the 
EcoRl/Xbal -fragment containing the dhfr-gene, terminator to 
and the cat-gene is displayed in part b). Here the restriction 
endonucfease sites indicated in part a) are cverlined. In 
addition the pBR322 entity of pDSi.vi* is schematically 
shown, where the given numbers refer to the nucleotide 
sequence of pBR322 (J.G- Sutcfiffe. "Complete nucleotide 
sequence of the Escherichia cofi plasmid pBR322", Cold 
Spring Harbor Symp. Quant BfoL, 43, pp. 77-90 [1979]. 

Figure 2 is a schematic drawing of the ptasmid 
pDSl/P N25 ,t 0 r. The nucleotide sequence of the Xhol- 
fragment carrying promoter Pkzs is displayed. The site of 
initiation and the dir ec ti on of transcription are indicated by 
the arrow. 

Figure 3 is a schematic drawing of the ptasmid pEX07 lac 
op 2. The nucleotide sequence of the 
EcoRI/Hindlll-fragment carrying the tac operator is dis- 
played. 

Figure 4. Part a) is a schematic drawing of the plasmid 
pDSi.to2*. The nucleotide sequence of the 
Xhol/Xbal-fragment containing the dhfr-gene, the cat-gene 
and terminator to is displayed in part b). Here the restriction 
endonuctease sites indicated in part a) are overfined. In 
addition the pBR322 entity of pDSl.toT a schematically 
shown, where the given numbers refer to the nucleotide 
sequence of pBR322 (J.Q, Sutcfiffe, supra). 

Figure 5. Part a) is a schematic drawing of the ptasmid 



pDSrx.1. The nucleotide sequence of the 
EcoRI/Xbal-fragment containing the lac 1-gene, parts of the 
lac Z-gene (lac T) and the cat-gene is displayed in parts b) 
and c). Here the restriction endonuctease sites indicated in 
5 part a) are overfined. In addition the pBR322 entity of 
pDSTX.1 is schematically shown, where the given numbers 
refer to the nucleotide sequence of pBR322 (J.G. Sutcfiffe, 
supra). 

io Figure 6a.b,c is a schematic outline of the construction of 
the promoter/operator fusion Pics^o embedded in the ptas- 
mid pOS2/P N25*o,to2* . 

Figure 7 dtsplayes the nucleotide sequence of the 
TS XhoI/EcoRI fragment containing the promoter/operator fu- 
sion P1Q5X/0. The site of initiation and the erection of 
transcription are indicated by the arrow. 

Figure SaJxc is a schematic outrun of the construction of 
20 plasmids pDSXIU, pDS3/Pn25XA),to 2* and pDSXIII.i. 

Figure 9 is an etectropherograrn monitoring protein synthe- 
sis in the absence or presence of IPTQ in ExoG M15 
harboring plasmids pDSXJI.3 (lane 1). pOS2/Pk2s xyo,to2* 
25 (lanes 2 and 3) or pDS3/P N2sx/o.to2* (fanes 4-7). 



The construction of the promoter/operator fusion PN 
tesx/g of this invention is described more in detail in the 
30 following examples 

Example 1 



35 Descr ip t i on of plasmids used for the construction of pro- 
moter Pies X/o 



A. Principles 

40 

pOSI. 1 0 1* (figure 1) and pOSl. WT (figure 4) were 
chosen as vectors not only for the in tegratio n of promoter 
Pics and the construction of the fusion between promoter 
Prcsand the tac operator but also for the d em ons trati on of 
45 the function of the resulting promoter/operator fusion 
Piesfto- 

ARhough there are several well defined sources for 
promoter P nzs. plasmid pOSI/P^vi* (figure 2) carrying 
this element on an EcoRI-fragment was chosen as source 

50 for promoter Pn25 whereas pEX07 lac op 2 (figure 3; H. 
Weiher, "Untersuchungen 2ur Struktur und Funktion von 
Exofi Promotoren: Variation des Lac Promoters durch 
geziette Neukonstruktion von Tetfsequenzen und 
Mutagenese", PhD. thesis. University of Heidelberg, FRG 

55 [1980D (XXTiprising 21 basepairs of the fac operator was 
chosen to provide the operator entity in the 
promoter/operator fusion. The lac repressor is encoded by 
the lac I gene. Since a promoter is efficiently repressed by 
binding of repressor to the operator only if sufficient 

60 amounts of repressor molecules are present the lac l q aDel 
was used with a mutant promoter resulting in an increased 
transcription of the gene to provide enough repressor mol- 
ecules. As source of this mutant lac I gene plasmid plX.i 
was chosen (figure 5). 

65 
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B. Plasmid pDSi.vr 

The part of pDSl.toi* (figure 1; 0. S tuber and H. 
Bujard. supra) between the sites for restriction en- 
donucleases Xbal and EcoRI containing the region required 
for DMA replication and maintenance of the plasmid in the 
cell and the entire gene for /S-lactamase conferring resis- 
tance to amptcflSn is pBR322 derived (F. Bolivar et at, 
"Construction and characterization of new cloning vehicles 
II. A multi-purpose cloning system", Gene, 2. pp. 95-113 
[1977]; J.G. Sutdiffe, supra). The remaining part of the 
plasmid carries sites for restriction endonucteases Xhol, 
EcoRI and BamHI followed by the coding region for 
dirrydrofotate reductase of mouse AT-3000 ceD line (A.C.Y. 
Chang et aL, "Phsnotypic expression in E coB of a DMA 
sequence coding for mouse dihyrjrofolate reductase", Na- 
ture. 275, pp. 617-624 [1978]; J.N. Masters and G. Attardi, 
"The nucleotide sequence of the cONA coding for the 
human dihycfrofolic acid reductase", Gene, 21, pp. 59-63 
[1983]), by the terminator to of phage lambda (E. Schwarz 
et at, "Nucleotide sequence of cro, ctl and part of the O 
gene in phage X DNA". Nature, 272, pp. 410-414 [1978]; 
D. S tuber and H. Bujard, supra) and by the promoter-free 
gene for chloramphenicol acetyltransferase (R. Marcoli et 
at, "The DNA sequence of an IS1 -flanked transposon 
coding for resistance to chloramphenicol and fusidtc acid", 
FEBS Letters. 110, pp. 11-14 [1980p. 

C. Plasmid pDSl/Pnes ,tol* 

Plasmid pDSl/Pies.tor (figure 2) corresponds to 
pDSi,tbi* (figure 1) but contains on an Xhrjl-fragment 
promoter Pks of Exoli phage T5 (A. von Gabain and H. 
Bujard, supra; D. Stuber et at, "Electron microscopic analy- 
sis of in vitro transcriptional complexes: Mapping of promot- 
ers of the cofiphage T5 genome". Molec. gen. Gen.. 168. 
pp. 141-149 [1978]; H. Bujard et aL, supra). 

D. Plasmid pEX07 lac op 2 

Plasmid pEX07 lac op 2 (figure 3; H. Weiher, supra) is 
a derivative of pBR322 (F. Bolivar et aL, supra; J.G. 
Sutdiffe, supra) where the EcoRVHindlll-fragment of 
pBR322 is replaced by an EcoRI/Hindlll-fragment contain- 
ing 21 basepairs of the lac operator sequence (J.H. Miller 
and W.S. Reznikoff, supra). 

E. Plasmid pDSl.to^ 

The part of pOSl.to2* (figure 4; D. Stuber and H. 
Bujard, supra) between the sites for restriction en- 
donucteases Xbal and Xhol containing the region required 
for DNA replication and maintenance of the plasmid in the 
cell and the entire gene for -lactamase centering resis- 
tance to ampicinrn is pBR322 derived (F. Bolivar et at. 
supra; J.G. SuteBffe, supra). The remaining part of the 
plasmid carries sites for restriction endonucteases EcoRI 
and BamHI followed by the coding region for dihydrofolate 
reductase of mouse AT-3000 ceD line (A.C.Y. Chang et at, 
supra; J.N. Masters and G. Attardi, supra), by the 
promoter-free gene far chloramphenicol acetyltransferase 
(R. Marcoli et at, supra) and by the terminator to of phage 
lambda (E. Schwarz et at, supra). 



F. Plasmid pDSIX.1 

Plasmid pDSIX.1 (figure 5) carries the identical part of 
PBR322 as plasmid pDSi.tol* (figure 1). In addition it 

5 contains the entire gene for the lac repressor (P.J. 
Farabaugh. "Sequence of the lad gene". Nature. 274. pp. 
765-769 [1978]) with the promoter mutation l<> {MP. Catos, 
supra) followed by sites for restriction endonucteases EcoRI 
and Xhol, by the regulatory and part of the coding region of 

io the gene for fl-galactosidase (A. Kalnins et at, "Sequence 
of the lac Z gene of Escherichia coli". The EMBO J.. 2, pp. 
593-597 [1983 J; R. Gentz et at, "Cloning and analysis of 
strong promoters is made possible by the downstream 
placement of a RNA termination signal", PNAS. 78, pp. 

JS 4936-4940 [1981]) and by the promoter-free gene for 
chlorarnrjhenicol acetyltransferase (R. Marcoli et at, supra). 

Example 2 

20 

Construction of the plasmid pDs2/P N25 x«),t 0 2* carrying the 
promoter/operator fusion Pms x/o 

The promoter/operator fusion PnkX/o was prepared 
25 and embedded in pDSLto 2* to give pDS2/PN25*o,to 2* in 
a sequence of steps. These are depicted in figure 6 and 
more fully described below. 

A. Preparation of the promoter entity of the 
30 promoter/operator fusion 



a) Isolation of the EcrjRI-fragrnent containing Ptes 

35 To isolate the EcoRI -fragment carrying promoter Pta& 325 
pg of plasmid pOSi/Pn^toi* were digested with EcoRI 
(figure 6a) and after extraction with phenol followed by 
treatment with ether and subsequent precipitation with 
ethanol the DNA fragments were separated by etec- 

40 trophoresis in a 6% potyawyiamid gel. Following visualiza- 
tion of the DNA by staining with etrtirJum bromide, the 
fragment carrying Pkes was cut out of the gel and eluted by 
shaking the mashed gel slice for 10 hours in a buffer 
containmg 2-5 mM TrisHCI (pH 7£) and 0.25 mM EDTA. 

45 After removal of the gel pieces by centrifugation, the result- 
ing supernatant was extracted with phenol, treated with 
ether and the DNA fragment collected by precipitation with 
ethanoL 1.3 ug of the rjromoter-fragment were isolated in 
this way. 

50 

b) Limited digestion of the EcoRI-fragment with Hinfl 

i pg of the EcoRI-fragment was incubated in a volume of 
120 ul in a buffer containing 100 mM sodium chloride with 

55 21 units Hinfl for 9 minutes at 37°C. The enzyme was 
inactivated by incubation of the mixture for 7 minutes at 
65'C and withdrawn by three extractions with phenol. After 
removal of the residual phenol by extraction with ether the 
DNA was precipitated with ethanol and stored in a buffer 

60 containing 5 mM Tris/HCI (pH 7.6), 0.5 mM EDTA and 1 00 
mM sodium chloride. Analysis of the DNA by elec- 
trophoresis in a 6% polyacrytamid gel revealed that about 
60% of the EcoRI-fragment molecules had been cleaved by 
Hinfl. 

65 

c) Extending the recessed 3"-ends with DNA polymerase I 
0.17 ug of the Hinfl digested DNA was incubated in a 
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volume of 25 id in a buffer consisting of 100 mM potassium 
phosphate (pH 7.0), 100 mM potassium chloride, 7 mM 
magnesium chloride. 25 uM each of the four 
deoxynuciBOSide- triphosphates with 5 units of DMA poly- 
merase I for 40 minutes at 20°C. After removal of the 
enzyme by extraction with phenol, the solution was 
cruornatographed through Sephadex Q75 equifibrated in a 
buffer of pH 7.7 consisting of 2.5 mM Tris, 0.25 mM EDTA 
and 50 mM potassium acetate. DNA containing fractions 
were combined before the DNA was precipitated with 
ethanol and stored in ligation buffer supplemented with 50 
mM sodium chloride. The resulting solution with 0.08 ug 
DNA contained fragment 1 (figure 6a). 



B. Preparation of the operator entity of the 
promoter/operator fusion 

a) Limited digestion of pEX07 lac Op 2 

14.5 ug of ptasmid pEX07 lac op 2 (figure 6b) were 
incubated in a volume of 250 ul with 1.5 units EcoRI for 50 
minutes at 37 *C. The enzyme was inactivated by incuba- 
tion for 7 minutes at 65 *C and withdrawn by extraction with 
phenol After removal of residual phenol with ether the DNA 
was precipitated with ethanol and stored in a buffer consist- 
ing of 23 mM Tris/HCI (pH 7.6) and 0.25 mM EDTA. 
Analysis of the DNA by electrophoresis in a 6% 
polyacrylamid gel revealed that about 50% of the ptasmid 
molecules had been linearized. 

b) Removal of the 5'-protruding ends with S1 nuclease To 
remove the extended 5'-ends, the EcoRI digested DNA was 
treated with the single-strand specific nuclease Si (figure 
6b). In three separate experiments 1.7 ug DNA were in- 
cubated in a volume of 50 ul in a buffer containing 280 mM 
sodium chloride, 30 mM sodium acetate (pH 4.4). 4.5 mM 
zinc acetate and 20 ug Art single-stranded DNA of phage 
W with 160 units. 800 units and 4000 units Si nuclease 
(Boehrfnger, Mannheim, FRG) for 30 minutes at 20 °C. The 
reactions were stopped by adding ammonium acetate and 
EDTA to final concentrations of 0.5 M and 10 mM. respec- 
tively, and incubation for 7 minutes at 65 *C. After removal 
of the protein by extraction with phenol followed by treat- 
ment with ether, the three mixtures were combined and the 
resulting solution chromatographed through Sephadex G75 
equifibrated in a buffer of pH 7.0 consisting of 23 mM 
Tris/HCI, 0.25 mM EDTA and 50 mM potassium acetate. 
DNA containing fractions were combined before the DNA 
was precipitated with ethanol and stored in Ogata! buffer. 
The resulting solution with 4.3 ug DNA contained fragment 
2 (figure 6b). 



C. Preparation of plasmid pDSl,to2* accepting the 
promoter/operator fusion 

pOSl,to2* (figure 4) was chosen as vector to accept the 
prwnotar/operator fusion. Therefore. 5.1 pg DNA of this 
plasmid were digested to completion with restriction en- 
oonucteases Hindlll and Xhol (figure 6c). After inactivaton 
of the enzymes by incubation for 7 minutes at 65°C and 
removal of fne protein by extraction with phenol followed by 
treatment with ether, the DNA was precipitated with ethanol 
and stored in a buffer consisting of 25 mM Tris/HCI (pH 
7.6) and 0.25 mM EDTA 
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D. Construction of pDS2/P tasfaMZ" 

To construct vDSZJPkz&k,^ the promoter entity 
(fragment 1) and the operator entity (fragment 2) of P 

5 N25X«were Kgated together and the resulting fusion subse- 
quently embedded in pDSl,to2* (figure 6c). For the fusion 
of promoter and operator 0.05 pg of the DNA mixture 
containing fragment 1 and 3.5 pg of the DNA mixture 
containing fragment 2 were incubated in ligation buffer 

ro supplemented with 25 mM sodium chloride with 5 units 
T4-DNA Dgase for 7 hours at 15°C. After irtactivation of the 
enzyme by incubation for 7 minutes at 65°C. the ligation 
mixture was incubated in a volume of 100 ul with 20 units 
Hindlll and 15 units Xhol for 1 hour at 37 0 C to obtain the 

75 promoter/operator fusion as an Xhol/Hindlll fragment After 
inactrvaSon of the restriction endonucleases at 65 "C and 
their removal by extraction with phenol followed by treat- 
ment with ether, the DNA was precipitated with ethanol and 
redissotvad in ligation buffer supplemented with 25 mM 

20 potassium chloride. 0.25 pg plasmid pDSl,1o2* digested 
with Hindlll as well as Xhol and 1.3 units T4-DNA flgase 
were added and the mixture incubated for 4 hours at 15*C 
before the enzyme was inactivated by incubation for 7 
minutes at 65°C. 

25 

Ecofi C600 (CaCl, competent) was transformed with half 
of the above prepared ligation mixture under appropriate 
conditions. Transformants were selected at 37»C on Mini- 
mal Agar Plates based on M9-medium (J.H. Mffler, 

30 "Experiments In Molecular Genetics". Cold Spring Harbor 
laboratory, pp. 431-432 [1972]) supplemented with 0.2% 
glucose, 05% casein nvdrotysate, 10 mg/Gter vitamin B1, 
40 mg/Bter x-gal ( 

5-bromo-4-criloro-3-irtoY)lyl-fl-D-galacto and 10-200 

35 ug/ml chtoramphenicoL 

Only transforrnants containing plasmid pDSl.t o2* with a 
promoter integrated in front of the promoter-free gene for 
chloramphenicol acetyttransferase were expected to grow 

40 on plates with concentrations of chldramphenicol of more 
than 10 ug/ml. Furthermore, transforrnants containing 
pDSl.to2* with PussXn integrated in front of this indicator 
gene were expected to become blue on these plates, due to 
the induction of the host lac system by binding of the lac 

45 repressor molecules to the operator sequence present in the 
plasmids and subsequent cleavage of the colourless X-gal 
to a blue derivative. Therefore. 18 blue transforrnants from 
plates with 50 ug/ml or higher concentrations of chloram- 
phenicol were selected and cultures grown at 37 "C in 

50 IB-medium containing 100 ug/ml ampfcfflin. DNA from 
these cultures was isolated using standard procedures and 
analyzed for its size and the presence of sites for restriction 
endonuclBases EcoRI, Hindlll. Hinfl and Xhol. One plasmid 
displaying the expected patterns after electrophoresis of 

55 restriction fragments in 6% polyacrylamid gels was des- 
ignated PDS2/PN2SX4) ,to2* (figure 6c). Sequence analysis 
of the Hindlll-Xhoi fragment of this plasmid confirmed the 
expected sequence for promoter Pnzskq as depicted in 
figure 7. 

60 
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Construction of ptasmkls pOSXII.3, pOS3.Pn2sx«.tD2* and 
pOSXlll.1 containing promoter Pte&m 

In addition to pDS2/P N 2sXa,t 0 2* (lacking repressor 
gene) three more plasmas containing the promoter/operator 
fusion P N25*o, namely pDSXII.3 fintermediate for the man- 
ufacture of pDS Xlll.l), pDS3/Pic5X /o,to2* (as an example 
of plasmids according to the invention) and pOSXIH.1 (with 
a partly deleted cat-gene lacking RBS; as a negative con- 
trol for comparative purposes), were designed to dem- 
onstrate the function of this element The construction of 
these plasmids is depicted in figure 8 and more fully de- 
scribed below. 

A. Construction of plasmid pDSXII.3 



Plasmid pOSXII,3 was derived from pDS2/P ne5M>,to2* by 
eliminating the EcoRI-fragment of this plasmid containing 
the single site for restriction endonucteases Hindlll and part 
of the gene for chloramphenicol acetyltransferase (figure 
8a). For that purpose 0.45 pg of pDS2/Pu2s x^toT were 
digested to completion with restriction endonuclease EcoRI. 
After removal of the enzyme by extraction with phenol, the 
solution was chromatographed through Sephadex Q75 
equilibrated in a buffer of pH 7.6 consisting of 2.5 mM 
Tris/HCI and 0.25 mM EDTA. DMA containing fractions 
were combined and the resulting solution was concentrated 
by lyopnifization. 0.13 ug of the plasmid digested with EcoRI 
were incubated in a volume of 100 ul in ligation buffer with 
1.3 units T4-DNA figase for 5 hours at 15°C before the 
enzyme was inactivated by incubation for 7 minutes at 
65*C- 

Exoti M15 (CaCi, competent) was transformed with 0.07 
ug of the Ggated DNA under appropriate conditions. Trans- 
formants were selected at 37°C on Minimal- Agar plates 
based on Md-medium (Jit Miller, supra) supplemented 
with 0.2% glucose, 0.5% casein hydrotysate, 10 mg/lter 
vitamin Bl, 40 mg/liter X-gal and i mM IPTG 
(Isopropyrthiogalactoside) and containing either 100 ug/ml 
ampicinin or 50 ug/ml chloramphenicol. 

Since loss of the EcoRI-fragment comprising part of the 
gene for chloramphenicol acetyttransferase results in loss of 
resistance to chloramphenicol, plates with amptcillin were 
found to have about 40 times more transformants than 
plates with ampidffin. 4 transformants resistant to amptcilltn 
were selected and cultures grown in LB-medium containg 
100 ug/ml ampicfflia DMA from these cultures was isolated 
using standard procedures and analyzed for its size and 
presence of sites for restriction endonucteases EcoRI. Hin- 
dlll. Hinfl. Psti and Xhol. One plasmid displaying the ex- 
pected patterns after electrophoresis in 6% poryacrytamtd- 
gels was designated pDSXII.3 (figure 8a). 



B. Construction of plasmids pDS3/P N2 5X rt> ,t 0 2* and 
pDSXIII.1 

Plasmids pDS3/P N 2 S * 0 ,t o 2* and pDSXIII,1 were derived by 
combining appropriate fragments of plasmids pDSIX.1 and 
pDS2/P N25 xo.to2* (figure 8b) or pDSXII,3 (figure 8c). re- 
spectively. For that purpose, 1.25 pg of these three plas- 
mids were digested in three separate reactions to comple- 
tion with restriction endonucteases Psti and Xhol. After 
removal of these enzymes by extraction with phenol, the 
three solutions were chromatographed through Sephadex 



G75 equilibrated in a buffer of pH 7.6 consisting of 25 mM 
Tris/HCI and 0.25 mM EDTA. DNA containing fragments of 
each experiment were combined and the resulting solutions 
c on c en trated by fyophilizatiort 

5 

In two separate reactions 0.13 ug digested DNA of each of 
the plasmids pDSIX,1 and pDS2/Pne5XA),io2* or pDSIX.1 
and pDSXII,3 were incubated in a volume of 60 ul in 
ligation buffer with 1.3 units T4-DNA ligase for 10 hours at 

70 15"C before the enzyme was inactivated by incubation for 
7 minutes at 65 °C. Again in two separate reactions E.coli 
Mi 5 (CaCI, competent) was transformed with 0.12 ug of 
the tigated DNA under appropriate conditions. Transfor- 
mants were selected at 37°C on Minimal- Agar plates based 

75 on M9-medium (J.H. Miller, supra) supplemented with 0.2% 
glucose, 0.5% casein hydrotysate, 10 mg/fiter vitamin Bt, 
40 mg/rrter X-gal and 1 mM IPTG and containing 100 ug/ml 
ampscfilin. Six white transformants of each transformation 
were selected and cultures grown in LB-medium containing 

20 100 ug/ml amptcillin. DNA of these cultures was isolated 
using standard procedures and analyzed for its size and 
presence of sites for restriction endonucteases EcoRI. Hin- 
dlll. Hinfl. Psti and Xhol. Two plasmids derived from 
pDS2/Pi e5 X/o.t 0 2* and pDSXII.3 displaying the expected 

25 patterns after electrophoresis in 6% potyacrylamid gels 
were designated pDS3/PN25*rj,to2* (figure 8b) and 
pDSXiil.1 (figure 8c), respectively. 



30 

Example 4 



Expression of chloramphenicol acetyttransferase in plamids 
35 containing promoter/operator fusion Pnes X/o 

To demonstrate the usefulness of promoter/operator 
fusion Pn2sx«, the synthesis of chloramphenicol acetyttrans- 
ferase in cells harboring plasmids containing P tesxiowas 

40 monitored. Ecofi M15 cells transformed with either 
pDS2/PN25*o.to2* . pDSXIII.1 or pDS3rPnasX(0, to2* were 
grown in the presence or absence of 1 mM IPTG in 
LB-medium containing 100 ug/ml ampidlBn at 37 "C to a 
density of i.yioa cells/ml 1.5- 10 s cells were collected by 

45 centrifugation and resuspended in sample buffer containing 
1% SDS, 1% /3-mercaptoethanol, 10% glycerol and 62.5 
mM Tris/HCI (pH 6.8). Samples were boiled for 5 minutes, 
chilled on ice, centrifuged at 12 000 xg for 30 seconds and 
electrophoresed in a SDS-containing potyacrylamide gel 

so (125% acryiamide) according to the procedure of U. Laem- 
mfi, 'Cleavage of structural proteins during the assembly of 
the head of Bacteriophage T4", Nature, 227, pp. 680-682 
[1970]. After staining of the proteins with Coomassie brillarrt 
Blue R-250 the unbound dye was removed from the get A 

55 fotograph of this gel is shown in figure 9- 

Comparing now lanes 1, 2 and 3 of figure 9 it be- 
comes obvious that as expected pDSXill.l (lane 1) encodes 
tec repressor (R) but not tfiirjramprtenicol acetyltransferase 
(CAT) whereas in cells transformed with pDS2/Pt!C5*o,t <> 2* 

60 (lanes 2 and 3) the repressor is absent and chloram- 
phenicol acetyltransferase is produced in large amounts 
independent of the presence of IPTG in the medium. Al- 
though pDS3/PN25X*),to2* differs from pDS2/pN25X/o.to2* 
only in comprising the gene for the lac repressor (figure 8b), 

65 in cells transformed with pOS3/PN25X/o,to2* only unviable 
amounts of chloramphenicol acetyltransferase are produced 
in the absence of IPTG (compare lanes 4 and 5 with lanes 
1, 2 and 3). However, in cells harboring pDS3/PN25*o,to2* 
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and grown in the presence of IPTG essentially the same 
amount of chloramphenicol acstyttransferase is produced as 
in cells containing pDS2/P Hzsxn,^ {compare lanes 6 
and 7 with lanes 2 and 3). 

The results described above show, that promoter 
PiesKo is repressed in the presence of lac repressor and is 
induced essentially to full activity by addition of the inducer 
IPTG. 

Claims 



1. An expression control ONA sequence compris in g a col- 
iphage T5 promoter combined with a DNA sequence which 
permits the control of promoter activity. - 

2. An expression control DNA sequence according to claim 
1. wherein said coBphage T5 promoter is the P«s pro- 
moter. 

3. An expression control DNA sequence according to claim 
lor 2 wherein the DNA sequence which permits the 
control of promoter activity is an operator. 

4. An expression control DNA sequence according to claim 
1, 2 or 3 wherein said operator is the lac operator or a 
functional part thereof. 

5. An expression control DNA sequence according to claim 
4. wherein the lac operator or a functional part thereof 
overlaps the T5 promoter sequence. 

6. An expression control DNA sequence according to any 
one of claims 1 to 5 comprising the functional part of the 
nucleotide sequence of fig. 7. 

7. An expression vector comprising an expression control 
DNA sequence according to claims 1 to 6. 

8. An expression vector according to claim 7 which is a 
plasmid capable of replication in gram-negative and/or 
gram-positive bacteria. 

9. An expression vector according to claim 8 which is 
capable of replication in an ExoG strata 

10. An expression vector according to claim 8 which is 
capable of replication in a B-suotflis strain. 

11. An expression vector according to claim 8 or 9 which is 
a member of the pDS 1 plasmid family. 

12. An expression vector of claim 11 which rs pDS2/P 

N25X/D.to2*. 

13. An expression vector of claim 11 which is pDS3/Pn2sx/o 
ito2*. 

14. A trartsformarrt carrying an expression vector claimed in 
any one of daims 7 to 13. 

15. A transformant according to claim 14 which is an E.cofi 
strain. 

16. A transformant according to claim 15 which is an Exofi 
M15 strain. 

17. A transformant according to claim 14 which is a 



B.subtifis strain. 

18. A process for the manufacture of a pro- or eukaryotic 
polypeptide which process comprises transforming a host 

5 with an expression vector axnaining the DNA sequence 
coding for said polypeptide operativery linked to an expres- 
sion control DNA sequence as claimed in any one of claims 
i to 6, cufturing the transformant under appropriate con- 
ditions of growth and isolating (he desired polypeptide from 

10 the culture. 

19. A process according to claim 18 wherein the vector is a 
plasmid capable of replication in gram-nega + tive and/or 
gram-positive bacteria. 

75 

20. A process according to claim 19 wherein the plasmid is 
capable of replication in an E.cdi strata 

21. A process according to claim 19 wherein the plasmid is 
20 capable of replication in a B-subtrSs strata 

22. A process according to claim 19 or 20 wherein the 
plasmid is a member of the pDSl family. 

25 23. A process according to claim 22 wherein the plasmid is 
pDS2/P N2s x /0 .to2\ 

24. A process according to claim 22 wherein the plasmid is 
PDS3/Pie5*o,to2\ 

30 

25. A process according to claims 18 to 24 wherein the 
polypeptide is chloramphenicol a cetylliansferasa 

26. The use of an expression control DNA sequence as 
35 claimed in any one of claims 1 to 6 for expressing a pro- or 

eukaryotic polypeptide. 



Claims for contracting State AT 

40 

1. A process for the preparation of an expression control 
DNA sequence comprising a coGphage T5 promoter com- 
bined with a DNA sequence which permits tie control of 
45 promoter activity, which process comprises combining a 
coliphage T5 promoter in a manner weO known in the art 
with a DNA sequence which permits the control of promoter 
activity to a functional untL 

50 2. A pro cess according to claim 1, wherein said coBphage 
T5 promoter is the Puzs promoter. 

3. A process according to claim 1 or 2 wherein the DNA 
sequence which permits the control of promoter activity is 

65 an operator. 

4. A process according to claim i. 2 a 3 wherein said 
operator is the lac operator or a functional part thereof. 

60 5. A process according to claim 4, thereby creating an 
expression control DNA sequence characterized in foat the 
lac operator or a functional part thereof overlaps the T5 
promoter sequence. 

65 6. A process according to any one of claims 1 to 5. thereby 
creating an expression control DNA sequence comprising 
the functional part of the nucteotid sequence of fig. 7. 
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7. A process for the preparation of an expression vector 
comprising a DNA portion corresponding to an expression 
control DNA sequence obtained according to any one of 
claims 1-6, which process comprises providing said DNA 
portion and introducing said DNA proton into a convenient 
expression vector. 

8. A process acccording to claim 7. wherein said conve- 
nient expression vector is a piasmid capable of replication in 
gram-negative and/or gram-positive bacteria 

9. A process according to claim 8, wherein said piasmid is 
capable of replication in an E. coli strain. 

10. A process according to claim 8. wherein said piasmid is 
capable of replication in a B. subfflis strain. 

11. A process according to claim 8 or 9. wherein said 
piasmid is a member of the pDS 1 family. 

12. The process of claim 11, wherein said piasmid is 

POS2/PN25X0.1 o2* . 

13- The process of claim 11. wherein said piasmid is 
pDS3/P, e5 x,o.to2\ 

1*. A process for the preparation of a transformant carrying 
an expression vector obtained according to any one of 
claims 7 to 13 which process comprises transforming a 
vector with said expression vector by methods known in the 
art 

15. A process according to claim 14, wherein said vector is 
an E coli strain. 

16. A process acrjording to claim 15, wherein said E. coli 
strain is an E. cofi Ml 5 strain. 



17. A process according to claim u. wherein said vector is 
a 8. subtjfis strain. 

s 18. A process for the manufacture of a pro- or eukaryotic 
polypeptide which process comprises transforming a host 
with an expression vector containing the DNA sequence 
coding for said polypeptide operaoveiy linked to an expres- 
sion control DNA sequence obtained according to any one 

10 of claims 1 to 6, culturing the transformant under appro- 
priate conditions of growth and isolating the derived poly- 
peptide from the culture. 

19. A process according to claim 18 wherein the vector is a 
TS piasmid capable of replication in gram-negative and/or 

gram-positive bacteria. 

20. A process according to claim 19 wherein the piasmid is 
capable of replication in an Ecofi strain. 

20 

21. A process according to claim 19 wherein the piasmid is 
capable of replication in a B.sutjtitis strain. 

22. A process according to daim 19 or 20 wherein the 
25 piasmid is a member of the pDSi family. 

23. A process according to claim 22 wherein the piasmid is 
pDS2/P N2 sX*.t 0 2\ 

30 24. A process according to claim 22 wherein the piasmid is 
pDSa'Pi e5 X/ 0 ,to2*. 

25. A process according to claims 18 to 24 wherein the 
polypeptide is chloramphenicol acetyhransferase. 

35 
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40 


50 


0 


GAAXXCCXCG 


AGGAAXTCCG 


GAXCCGGCAX 


CAXGGXXCGA 


CCAXXGAACX 


50 


GCAXCGTCGC 


CGXGXCCCAA 


AAXAXGGGGA 


XXGGCAAGAA 


CGGAGACCXA 


100 


CCCTGGCCTC 


CGCXCAGGAA 


CGAGTTCAAG 


TACXTCCAAA 


CAATGACCAC 


150 


AACCTCTTCA 


GXGGAAGGXA 


AACAGAAXCX 


GGXGAXXAXG 


GGXAGGAAAA 


200 


CCTGGTTCTC 


CAXXCCXGAG 


AAGAATCCAC 


CXXXAAAGGA 


CAGAAXXAAX 


250 


AXAGXXCXCA 


GXAGAGAAGT 


CAAACAACCA 


CCACGAGGAG 


CXCAXXTTCX 


300 


TGCCAAAAGT 




nfptp n h rh CP 


XAXXGAACAA 


CCGGAAXXGG 


350 


CAAGTAAAGT 


aSjti. Kiti-L IrisJ. M 


J. (s\sAU.AlrJ. LrKr 


GAGGCAGXXC 


XGXXXACCAG 


400 


GAAGCCATGA 


bthb arm fin 




CXCXXXGXGA 


CAAGGAXCAX 


150 


GCAGGAATTT 


HUBS HTHfi f B 
VrAAJHrJ. IrAUA 


/*^i!lil|IJJ I'll 111/'/*^ 


AGAAAXXGAX 


XXGGGGAAAX 


500 


ATAAACXTCT 




CCAGGCGJ.CG 


XCXCXGAGGX 


CCAGGAGGAA 


550 


AAAGGCAXCA 


AlrJ.AlAAlxAJL 


1 CAACrJ. CJLAC 


CAGAAGAAAG 


ACXAACAGGA 


600 


AGAXGCXXXC 


AAGi 1 CI C± G 


cxccccxccx 


AAAGCXAXGC 


AXXXXXAXAA 


650 


GACCATGGGA 


CXJ.X2.KrCl. tstr 


CXI XAGA1 CC 


GGCCAAGCXX 


GGACXCCXGX 


700 


TGATAGATCC 


AGXAAXGACC 


XCAGAACTCC 


AXCXGGAXXX 


GXXCAGAACG 


750 


CTCGGTTGCC 


GCCGGGCGXX 


XXXXAXXGGX 


GAGAAXCCAA 


GCXXGGCGAG 


800 


ATTTXCAGGA 


GCXAAGGAAG 


CXAAAATGGA 


GAAAAAAAXC 


ACTGGAXAXA 


850 


CCACCGXXGA 


XAXAXCCCAA 


XGGCAXCGXA 


AAGAACAXXX 


XGAGGCAXXX 


900 


CAGXCAGXXG 


CXCAAXGXAC 


CXAXAACCAG 


ACCGXXCAGC 


XGGAXAXXAC 


950 


GGCCTTTTTA 


AAGACCGXAA 


AGAAAAAXAA 


GCACAAGXXX 


XATCCGGCCX 


1000 


TTATTCACAT 


XCXXGCCCGC 


CXGAXGAAXG 


CXCAXCCGGA 


AXXCCGXAXG 


1050 


GCAAXGAAAG 


ACGGXGAGCX 


GGXGAXAXGG 


GAXAGXGXXC 


ACCCXXGXXA 


1100 


CACCGXXXXC 


CAXGA GCAAA 


CXGAAACGXX 


XXCAXCGCXC 


XGGAGXGAAX 


1150 


ACCACGACGA 


XXXCCGGCAG 


XXXCXACACA 


XAXAXXCGCA 


AGAXGXGGCG 


1200 


TGXXACGGTG 


AAAACCXGGC 


CXAXXXCCCX 


AAAGGGXXXA 


XXGAGAAXAX 


1250 


GXXXXXCGXC 


XCAGCCAAXC 


CCXGGGXGAG 


XXXCACCAGX 


XXXGAXXXAA 


1300 


ACGXGGCCAA 


XAXGGACAAC 


XXCXXCGCCC 


CCGXXXXCAC 


CAXGGGCAAA 


1350 


XAXXAXACGC 


AAGGCGACAA 


GGTGCTGAXG 


CCGCTCGCGA 


XTCAGGXXCA 


1100 


XCAXGCCGXC 


XGXGAXGGCX 


XCCAXGXCGG 


CAGAAXGCXX 


AAXGAAXXAC 


1450 


AACAGXACXG 


CGAXGAGXGG 


CAGGGCGGGG 


CGXAAXXXXX 


XXAAGGCAGX 


1500 


XAXXGGXGCC 


CXXAAACGCC 


XGGGGXAAXG 


ACXCXCXAGA 


GC 
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10 20 30 40 50 

Xhol 

O CTCGAGGCTG GCATCCCTAA CATATCCGAA TGGTTACTTA AACAACGGAG 
50 GACTAGCGTA TCCCTTCGCA TAGGGTTTGA GTTAGATAAA GTATATGCTG 

Hinfl 

100 AACTTTCTTC TTTGCTCAAA GAATCATAAA AAATTTATTT GCTTTCAGGA 

Hinfl 

150 AAATTTTTCT GTATAATAGA TTC ATAAATT TGAGAGAGGA GTTTAAATAT 

Xhol 

200 GGCTGGTTCT CGCAGAAAGA AACATATCCA TGAAATCCCG CCTCGAG 
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Fig. 4b 1Q 


20 


30 


40 


50 


0 


CCTCGAGGAA 


TTCCGGAXCC 


GGCATCATGG 


XTCGACCAXT 


GAACXGCAXC 


50 


GTCGCCGTGT 


CCCAAAAXAX 


GGGGAXXGGC 


AAGAACGGAG 


ACCTACCCTG 


100 


GCCTCCGCTC 


AGGAACGAGX 


XCAAGXACXX 


CCAAAGAAXG 


ACCACAACCX 


150 


CXXCAGXG.GA 


&GGXAAACAG 


AATCXGGXGA 


XXAXGGGTAG 


GAAAACCTGG 


200 


XXCXCCAXTC 


CXGAGAAGAA 


TCGACCXTXA 


AAGGACAGAA 


XXAAXAXAGX 


250 


XCXCAGXAGA 


GAACXCAAAG 


AACCACCACG 


AGGAGCXCAT 


TXTCXTGCCA 


300 


AAAGTTTGGA 


XGAXGCCXXA 


AGACXTAXXG 


AACAACCGGA 


AXXGGCAAGX 


350 


AAAGTAGACA 


XGGXXXGGAX 


AGXCGGAGGC 


AGXXCXGXXX 


ACCAGGAAGC 


400 


CATGAATCAA 


CCAGGCCACC 


XXAGACXCXX 


XGXGACAAGG 


AXCAXGCAGG 


450 


AATTTGAAAG 


XGACACGXXX 


XTCCCAGAAA 


XXGAXTTGGG 


GAAAXAXAAA 


500 


CXXCTCCCAG 


AAXACCCAGG 


CGXCCXCXCX 


GAGGXCCAGG 


AGGAAAAAGG 


550 


CAXCAAGXAT 


AAGXXXGAAG 


XCTACGAGAA 


GAAAGACXAA 


CAGGAAGAXG 


600 


CTTTCAAGTT 


CXCXGCXCCC 


CXCCXAAAGC 


TAXGCAXXTX 


XATAAGACCA 


650 


TGGGACTXTT 


GCXGGCXXXA 


GAXCCGGCCA 


AGCXXGGCGA 


GATXXXCAGG 


700 


AGCTAAGGAA 


GCTAAAAXGG 


AGAAAAAAAT 


CACXGGAXAX 


ACCACCGXXG 


750 


AXATAXCCCA 


AXGGCAXCGX 


AAAGAACAXT XXGAGGCAXX 


XCAGXCAGXX 


800 


GCXCAATGTA 


CCTATAACCA 


GACCGTXCAG 


CTGGAXATTA 


CGGCCTTTTX 


850 


AAAGACCGXA 


AAGAAAAAXA 


AGCACAAGXX 


XTATCCGGCC 


XXXAXXCACA 


900 


XXCXXGCCCG 


CCXGAXGAAX 


GCXCAXCCGG 


AAXXCCGTAX 


GGCAAXGAAA 


950 


GACGGXGAGC 


TGGXGATATG 


GGAXAGXGXT 


CACCCXXGXF 


ACACCGXXXX 


1000 


CCAXGAGCAA 


ACXGAAACGX 


XXXCAXCGCX 


CXGGAGXGAA 


XACCACGACG 


1050 


AXXXCCGGCA 


GXXXCTACAC 


AXATAXXCGC 


AAGAXGXGGC 


GXGTXACGGX 


1100 


GAAAACCXGG 


CCXAXXXCCC 


XAAAGGGXXT 


ATTGAGAAXA 


XGXXTXTCGX 


1150 


CTCA GCCAAT 


CCCXGGGXGA 


GXXXCACCAG 


XXXXGAXXXA 


AACGXGGCCA 


1200 


AXAXGGACAA 


CXXCXXCGCC 


CCCGXXXXCA 


CCAXGGGCAA 


AXAXXAXACG 


1250 


CAAGGCGACA 


AGGXGCXGAT 


GCCGCXGGCG 


ATXCAGGXXC 


AXCAXGCCGT 


1300 


CXGTGATGGC 


XXCCAXGXCG 


GCAGAAXGCX 


TAAXGAAXXA 


CAACAGTACX 


1350 


GCGAXGAGXG 


GCAGGGCGGG 


GCGXAATXXX 


XXXAAGGCAG 


XXAXXGGXGC 


1400 


CCXXAAACGC 


CXGGGGXAAX 


GACXCXCXAG 


AGACXCCXGX 


XGAXAGAXCC 


1450 


AGXAAXGACC 


XCAGAACXCC 


AXCXGGAXTX 


GXTCAGAACG 


CTCGGXXGCC 


1500 


GCCGGGCGXX 


XXXXAXXGGX 


GAGAAXCXCX 


AGAGC 
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TAATTGCGTT 


GCGCXCACXG 


CCCGCXXXCC 


50 


AGXCGGGAAA 


CCTGTCGTGC 


CAGCTGCATT 


AAXGAAXCGG 


CCAACGCGCG 


100 


GGGAGAGGCG 


GTTTGCGTAT 


TGGGCGCCAG 


GGXGGTXXTX 


CXXXXCACCA 


150 


GTGAGACGGG 


CAACAGCTGA 


TTGCCCTTCA 


CCGCCXGGCC 


CXGAGAGAGX 


200 


TGCAGCAAGC 


GGTCCACGCT 


GGTXTGCCCC 


AGCAGGCGAA 


AAXCCTGXXT 


250 


GATGGTGGTT 


AACGGCGGGA 


XAXAACAXGA 


GCXGXCXXCG 


GXAXCGTCGT 

yjr X AX w vrX *-/ V* 


300 


ATCCCACTAC 


CGAGATATCC 


GCACCAA CGC 


GCAGCCCGGA 


CTCGGTAATG 

isx w wx Anx V 


350 


GCGCGCATTG 


C GCCCA GCGC 


CAXCTGATCG 


TTGGCAACCA 


GCATCGCAGT 


400 


GGGAACGATG 

\Jr v vAA lr vA* V 


CCCTCATTCA 


GCATXTGCAX 


GGTTTGTTGA 


AAACCGGACA 


450 


TGGCACXCCA 


GTCGCCXTCC 

V* X v v v X * w 1 v 


CGTTCCGCTA 

\j> v -X. -A. v-* W X 


TCGGCTGAAT 


TTGA TTGCGA 

X X VAX X VV VA 


500 


GTGAGATATT 


tatgcca err 

X AX W w A V w w 


AGCCAGA CGC 


AGACGCGCCG 




550 




GCTAACAGCG 




GTGA Crr A A T 


GCGA CCA GA T 

w WA VAi 


600 

U W V 








VUvavaaoaI 


jj fl yjj fTG7"PfZ 

aaXaOX (/XX 1/ 


U J u 


iu LrO'CrX CrX OX 


ggtca a a ciA r 


A7TAAGBAAT 


Anl/l/U^UlTAA 


VAX X a LrX VUA 


7nn 

/ uu 




ilC/nLrviUiX Crtr 


Wax tOx CrCrX u 


xix OUiivUUv/i 


XAlrxXAAX CrA 


750 


TCa GCCCA CT 


LrH O LrC» trx X CrO 


trO CrA Cr aa vaI 


x O-xtrO/iOOCrO 


f/tftwpmm nnnn 

LlrliXildl/Alr 


800 


GCTTCGA CGC 


CGCTTCGTTC 


TACCAXCGAC 


ACCACCACGC 


TGGCACCCAG 


850 


TTGATCGGCG 


CGAGAXXXAA 


XCGCCGCGAC 


AAXXXGCGAC 


GGCGCGXGCA 


900 


GGGCCAGACT 


GGAGGTGGCA 


ACGCCAAXCA 


GCAACGACXG 


XXXGCCCGCC 


950 


AGTTGXTGTG 


CCACGCGGTT 


GGGAATGXAA 


TTCAGCXCCG 


CCATCGCCGC 


1000 


TTCCACTTTT 


XCCCGCGXXX 


XCGCAGAAAC 


GXGGCXGGCC 


XGGXXCACCA 


1050 


CGCGGGAAAC 


GGTCTGATAA 


GAGACACCGG 


CATACTCTGC 


GACAXCGTAX 


1100 


AACGTTACTG 


GTTTCACATT 


CACCACCCXG 


AAXXGACXCX 


CXXCCGGGCG 


1150 


CTATCATGCC 


ATA CCGCGAA 


A CGTTTTCCA 


CCATTCGATG 


GTGTCAA CGT 

vX w X vAAV vX 


1200 


AAATGCATGC 


CGCTTCGCCT 


XCCCGCGCGA 


AXXCCXCGAG 


GAATTCCTCG 


1250 


AACCGCCTCC 


XGCAACTCXC 


XCAGGGCCAG 


GCGGXGAAGG 


GCAAXCAGCX 


1300 


GTTGCCCGTC 


TCGCTGGTGA 


AAAGAAAAAC 


CACCCXGGCG 


CCCAAXACGC 


J. 0 3 U 


DUB rTTTT'Tf' 


TCCCCGCGCG 


XXGGCCGAXX 


CAXXAAXGCA 


GCXGGCACGA 


1400 


CAGGTTTCCC 




G OXxCrGACrx CrA 




AjL x A AX CrX Cr A 


1450 


GTTAGCTCAC 


TCATTAGGCA 


CCCCAGGCXX 


XACACXTXAX 


GCXXCCGGCX 


1500 


CGTATGTTGT 


GTGGAATTGT 


GAGCGGAXAA 


CAAXXXCACA 


CAGGAAACAG 


1550 


CTATGACCAT 


GATTACGGAT 


XCACXGGCCG 


XCGXXXXACA 


ACGXCGXGAC 


1600 


TGGGAAAACC 


CTGGCGTTAC 


CCAACXXAAX 


CGCCXXGCAG 


CACAXCCCCC 


1650 


CTTCGCCAGC 


TGGCGTAATA 


GCGAAGAGGC 


CCGCACCGAX 


CGCCCXXCCC 


1700 


AACAGTTGCG 


CAGCCTGAAT 


GGCGAAXGGC 


GCXXXGCCXG 


GXXXCCGXCG 
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17 50 AAGCXXGGCG ACATTTTCAG GAGCTAAGGA AGCTAAAATG GAGAAAAAAA 
1800 TCACTGGATA TACCACCGTT GATATATCCC AATGGCAICG XAAAGAACAX 
1850 TITGAGGCAT XXCAGXCAGX XGCXCAAXGX ACCTATAACC AGACCGTTCA 
1900 GCTGGATATT ACGGCCTXTT TAAAGACCGT AAAGAAAAAX AAGCACAAGT 
1950 TTTAXCCGGC CTTTATTCAC ATXCTTGCCC GCCTGATGAA TGCTCAXCCG 
2000 GAAXXCCGXA TGGCAATGAA AGACGGTGAG CXGGXGAXAX GGGAXAGTGT 
20 50 TCACCCTTGT TACACCGTTT TCCATGAGCA AACXGAAACG XXXXCAXCGC 
2100 TCTGGAGTGA ATACCACGAC GATTTCCGGC AGTTTCTACA CAXATATTCG 
2150 CAAGATGTGG CGTGTTACGG TGAAAACCTG GCCTAXTTCC CTAAAGGGTT 
2200 TATTGAGAAT AXGTTXTXCG XCXCAGCCAA XCCCXGGGXG AGXXXCACCA 

22 50 GXTTTGATTT AAACGXGGCC AAXAXGGACA ACXTCTTCGC CCCCGITXXC 
2300 ACCATGGGCA AAXAXXAXAC GCAAGGCGAC AAGGXGCXGA XGCCGCXGGC 

23 50 GAXXCAGGXX CAXCAXGCCG XCXGXGAXGG CXXCCAXGXC GGCAGAAXGC 
24-00 XXAAXGAAXX ACAACAGXAC XGCGAXGAGX GGCAGGGCGG GGCGXAAXXT 
2150 XXXXAAGGCA GXXAXXGGXG CCCXXAAACG CCXGGGGXAA XGACXCXCXA 
250 0 GAGC __ 
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